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A polynomial transformation for non-Hermitian matrices is presented, which provides access to wedge-shaped 
spectral windows. For Wilson-Dirac type matrices this procedure not only allows the determination of the 
physically interesting low-lying eigenmodes but also provides a substantial acceleration of the eigenmode algorithm 
employed. 



1. Introduction 

The spectra of Wilson-Dirac type matrices M 
have an elliptic shape, with the real parts of the 
eigenvalues being positive. Since the large eigen- 
modes correspond to the doublers, only the small 
ones should contain physics. 

Employing the Arnoldi algorithm the 
straightforward approach to the described eigen- 
problem is to ask the algorithm for the eigen- 
modes of smallest modulus. By introducing shifts 
along the real axis {M + al), the part of the spec- 
trum to be found by the algorithm can be influ- 
enced. 

On the other hand we observe that the Arnoldi 
algorithm converges much faster when asking for 
eigenmodes of largest modulus or largest real part 
of (— M -I- (J I) instead, where on a quenched 4^ 
(8^) lattice a gain factor of about 2 (5) in CPU 
time can be achieved. The disadvantage of this 
approach is that the computed eigenmodes are 
not as 'low-lying' as before (in general their imagi- 
nary parts are larger). This problem can be cured 
by a modified approach, which will be described 
in the following. 



2. The polynomial transformation p^.n 

In order to focus the search for large eigen- 
modes of (— M -I- al) on the ones close to the 
real axis (i.e. on low- lying eigenmodes of M), we 
propose to determine the eigenmodes of largest 



real parts of 

PaAD) - [D + alf , (1) 

with D defined through M = I — kD. Using D is 
only a matter of notation, one could as well work 
with p{AI) = {-M + cr/)".[| 

For the standard Wilson-Dirac matrix, D can 
be even-odd preconditioned. The polynomial 
transformation then takes the form 

where e and o stand for even and odd respectively. 

The spectral windows amenable to po-,n can 
be identified by determining those complex num- 
bers, a, which are mapped into 5* = {a G 
C I real(pcr,Ti(a)) > c}, where c depends on the 
number of eigenmodes to be determined. This 
is illustrated in Fig. (|]). The curves enclose 
the complex numbers belonging to S, with their 
shape being fixed by a and n. With respect to the 
eigenmode algorithm, S represents the regions of 
sensitivity, i.e. the algorithm is capable to find 
eigenmodes lying inside the wedge-like regions S. 

3. Results 

The spectral windows of the standard and the 
clover improved Wilson-Dirac matrix amenable to 
Pa,n are displayed in Fig. (|). 

The left frame shows that the computed eigen- 
modes are indeed enclosed by the curves defined 

^ For nolynomial acceleration of eigenmode algorithms, see 
e.g. i 
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Figure 1. Spectral windows of sensitivity. Left: 
without preconditioning, for n = 16, cr = 30 and 
c = (cr + 3)", right: with even-odd precondition- 
ing, for n = 8, = 30 and c = (ct + 3)". 



through S and that they are close to the real axis, 
i.e. low-lying. 

Furthermore the second frame illustrates that 
with too large an exponent n, physically uninter- 
esting eigenmodes with large imaginary parts will 
be computed by the algorithm. 

Finally the right frame demonstrates that pre- 
conditioning can help to substantially reduce n 
without changing the interesting window of sen- 
sitivity, as the curves cutting the real axis nearly 
coincide. 

In Fig. further such examples are given. 
The left frame shows 200 eigenmodes as found on 
a quenched 8"* lattice. The right frame compares, 
for a full QCD 16^ x 32 lattice, the eigenmodes 
found for the exponents n — 1 and n = 24. The 
plotted curves enclose the regions of the complex 
plane that were searched for eigenvalues, i.e. in- 
side the curves all eigenvalues are captured. 

One might suspect that the polynomial trans- 
formation Pa.n{D) goes along with a polynomial 
increase of the execution time due to additional 
matrix vector multiplications. That this is not 
so can be seen for the even-odd preconditioned 
matrix in Fig. (^) and for clover improvement 
in Fig. (|^), where the dependence of the number 
of matrix vector multiplications and of the CPU 
time on the exponent n are displayed.^ 

For the even-odd preconditioned Wilson-Dirac 
matrix one can see that the run time increases 
strongly on the 4^ and is almost stable on the 8^ 
lattice, whereas a dramatic decrease is found on 
the realistic lattice sizes 12'^ x 24 and 16'^ x 32. For 
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Figure 2. 100 eigenvalues (fat points), determined 
on a quenched 4^ lattice for (3 = 5.0. Left: Pa,n 
with cr = 30 and n = 8, even-odd precondi- 



tioned. Middle: 



Pa 



with (T = 30 and n — 16, 



even-odd preconditioned. Right: Comparison of 
P30,46 without even-odd preconditioning (dotted 
lines) with P3o,8 for even-odd preconditioned ma- 
trix (full lines). 
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^Surprisingly in some cases only for n = 4 no convergence 
was found. 



Figure 3. Left: 200 eigenvalues on a quenched 8^ 
lattice, for (3 = 6.0, a = 50 and n = 8. Right: 
50 eigenvalues on a full QCD 16"^ x 32 lattice, for 
P = 5.6 and cr — 50. The curve corresponds to 
n = 24. 



the last two lattices the CRAY T3E-1200 found 
an acceleration factor of 8, whereas the CRAY 
T3E-600 produced a gain factor of 9 and 14 re- 
spectively. 

For the even-odd preconditioned Wilson-Dirac 
matrix this systematics seems to be delayed. On 
the 8^ lattice an increase in computing time is 
detected, whereas on the 12^ x 24 an acceleration 
factor of 1.6 was found. 

Furthermore the results indicate that the factor 
of acceleration grows with the size of the lattice. 

Why does the polynomial transformation accel- 
erate the algorithm? 

At first one might expect that through the 
polynomial transformation the density of the 
eigenvalues is decreased. This would explain the 
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Figure 4. Dependence of the number of ma- 
trix vector multiplications (upper frame) and the 
CPU time (lower frame) on the exponent n (nor- 
malized such that for n = 1 all values are equal 
to 1). 50 eigenmodes of the even-odd precondi- 
tioned Wilson-Dirac matrix were determined with 
cr = 50. The 4^, 8^ and 12^ x 24 lattices are 
quenched. 

acceleration, since the convergence is sensitive to 
this quantity. But if that is the right explana- 
tion, the number of matrix vector multiplications 
would be decreasing as well. Since this is not al- 
ways the case (see Fig. (§ and Fig. (|)), a differ- 
ent mechanism must be at work. Taking a look at 
the time spent in the Arnoldi subroutines for dif- 
ferent exponents n immediately provides the an- 
swer: When employing the polynomial Pa,n one 
has to enter the Arnoldi subroutines less often, 
since in one Arnoldi step one performs n mul- 
tiplications instead of only one and thus saves 
the time normally spent in those routines. This 
means that for large n, the computing time is al- 
most exclusively used for the matrix vector mul- 
tiplications. Hence the optimal choice of polyno- 
mial is met, if just one Arnoldi factorization has 
to be performed. 
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Figure 5. Dependence of the number of matrix 
vector multiplications (left) and the CPU time 
(right) on the exponent n (normalized such that 
for n = 1 all values are equal to 1). 50 eigen- 
modes of the clover improved Wilson-Dirac ma- 
trix were determined with a = 50. Both lattices 
are quenched. 

4. Summary and comments 

It could be shown that through the polynomial 
transformation Pcr,n wedge-shaped spectral win- 
dows are accessible. In case of the even-odd pre- 
conditioned Wilson-Dirac matrix this led to an 
acceleration factor of an order of magnitude on 
typical lattices sizes, whereas for the clover im- 
proved Wilson-Dirac matrix a factor of the order 
of 2 was found. It could be shown that this fac- 
tor grows with the size of the lattice. Comparing 
with a search for the eigenmodes of smallest mod- 
ulus of A/, which provides a similar part of the 
spectrum as Pcr.n {D) , the factor of acceleration is 
even larger. 

The possible accuracy with which one can de- 
termine the eigenmodes is not affected by the 
polynomial transformation. 
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